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RAW SEQUENCE LISTING 
PATENT APPLICATION: US/09/983,000 

Input Set : A:\es.txt 

Output Set: N:\CRF3\11022001\l983000.raw 



DATE: 11/02/2001 
TIME: 15:30:46 



Does Not Comply 
Corrected Diskette Needed 



3 <110> APPLICANT: AGY Therapeutics 

4 Melcher, Thorsten 

5 Mueller, Sabine <^>J p-p- \p>t V^ 5 ^*^ ^ 

6 Chin, Daniel AD"ttdbelo^. 

8 <120> TITLE OF INVENTION: USE OF PROTEIN BIOMOLECULAR TARGETS IN THE TREATMENT AND 
VISUALIZATION OF 

9 BRAIN TUMORS 

11 <130> FILE REFERENCE: 263/180 -- PEagleman -- AGY 

C--> 13 <140> CURRENT APPLICATION NUMBER: US/09/983,000 

C--> 13 <141> CURRENT FILING DATE: 2001-10-17 

13 <160> NUMBER OF SEQ ID NOS : 35 

15 <170> SOFTWARE: Patentln version 3.1 



ERRORED SEQUENCES 




The type of errors shown exlit throughout 
the Sequence Listing. Please check subsequent 
sequences for similar errors. 




<210> SEQ ID NO: 23 

<211> LENGTH: 3143 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapiens 

<220> FEATURE: 

<221> NAME/KEY: REFERENCE 

<222> LOCATION: (1) . . (3143) 



<223> OTHER INFORMATION: Crosby, A.H.; Edwards, S.J., Murray, J.C. and Dixon, 



enomic organization of the human osteopontin gene: exclusion of t 
he locus from a causative role in the pathogenesis of dentinogene 
sis imperfecta type II JOURNAL Genomics 27 (1), 155-160 (1995) 



6224 <220> FEATURE: 

6225 <221> NAME/KEY: Gene 

6226 <222> LOCATION: (1)..(3143) 

6227 <223> OTHER INFORMATION: Osteopontin 
6230 <220> FEATURE: 

62 31 <221> NAME/KEY: CDS 

6232 <222> LOCATION: (374).. (427) 

6233 <223> OTHER INFORMATION: 

6236 <220> FEATURE: 

6237 <221> NAME/KEY: CDS 

6238 <222> LOCATION: (537).. (575) 

6239 <223> OTHER INFORMATION: 

6242 <220> FEATURE: 

6243 <221> NAME/KEY: CDS 

6244 <222> LOCATION: (884).. (964) 

6245 <223> OTHER INFORMATION: 

6248 <220> FEATURE: 

6249 <221> NAME/KEY: CDS 

6250 <222> LOCATION: ( 1232 )..( 1273 ) 

6251 <223> OTHER INFORMATION: 
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6254 <220> FEATURE: 
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6255 <221> NAME/KEY: CDS 

6256 <222> LOCATION: ( 1597 )..( 1920 ) 

6257 <223> OTHER INFORMATION: 

6260 <220> FEATURE: 

6261 <221> NAME/KEY: CDS 

6262 <222> LOCATION: ( 2305 )..( 2709 ) ^ vi<yv6 ^ afVXC M • 

6263 <223> OTHER INFORMATION: % V w J O Cl 

e-> <^|6SZ^y^^ ^ ^-"7 *s\irted as ^3 

6267 ggggaagtgt gggagcaggt gggctgggca gtggcagaaa cctgatgaca caatctcgcc 60 

6269 gcctccctgt gttggtggag gatgtctgca gcagcattta aattctggga gggcttggtt 120 

6271 gtcagcagca gcaggaggag gcagagacag catcgtcggg accagactcg tctcaggcca 180 

6273 gttgcagcct tctcagccaa acgccgacca aggtacagct tcagtttgct actgggttgt 240 

6275 gcattcagct gaatttcatg gggaagtcca aattctaagg aaaaaaatgt ggtagtataa 300 

6277 aaaggtatca ctgttgtaac ctatgaagat gtcagctatt cctttgaaat attttgcagg 360 

6279 aaaactcact acc atg aga att gca gtg att tgc ttt tgc etc eta ggc 409 

6280 Met Arg lie Ala Val Ile.Cys Phe Cys Leu Leu Gly 

6281 1 .5 10 

6283 ate acc tgt gec ata cca gtgagtacag ttgeatctta aagaaaattc 457 

6284 He Thr Cys Ala He Pro 

6285 15 

6287 ctgaaaataa ctgaattgtg tgcttccatg tgctaggagg acattcttgt aatctttctt 517 

6289 catcttttct gtttctaag gtt aaa cag get gat tct gga agt tct gag gaa 569 

6290 Val Lys Gin Ala Asp Ser Gly Ser Ser Glu Glu 

6291 20 25 

6293 aag cag gtaagcatct tttatgtttt tatatagtta aatcatttac tcaattatgg 625 

6294 Lys Gin 

6295 30 

6297 cgagaggtgc aagaaacgta tttgetgega tcaaatgagt tcatatttgt aaagcaattt 685 
6299 gaaagagtgc ctagcccaca gtaagtgcta cataagagtt tgttaaatga atetgeaaaa 745 
6301 aaaaaaaaaa ttacaaaaag gtacctaagg gtccgggtga etatatgett ccatcaagac 805 
6303 tagtgaagaa tggttgtttt ttccattcat ccctacattt ctttttttaa taatgataaa 865 

6305 catgeaaett ttttgtag ctt tac aac aaa tac cca gat get gtg gee aca 916 

6306 Leu Tyr Asn Lys Tyr Pro Asp Ala Val Ala Thr 

6307 35 40 

6309 tgg eta aac cct gac cca tct cag aag cag aat etc eta gec cca cag 964 

6310 Trp Leu Asn Pro Asp Pro Ser Gin Lys Gin Asn Leu Leu Ala Pro Gin 

6311 45 50 55 

6313 gtatttttaa acttctcata attaaactac agtgatgaaa gatagecaca ctcaggccat 1024 
6315 ttgggctgct cagatgaatc ctgccctgcc tgctggcaaa catgtgctta ggacattgac 1084 
6317 tgatctgeca tgttggcttc tctctgtgtt aagccatcca cagatgaggc tgaaaaataa 1144 
6319 aaactgcttt ggattaaaaa ggttaacttt tgaataaaaa agctaggcat gtgtgatgcg 1204 

6321 cactaacacg tgccattcct tcttcag aat get gtg tec tct gaa gaa acc aat 1258 

6322 Asn Ala Val Ser Ser Glu Glu Thr Asn 

6323 60 65 

6325 gac ttt aaa caa gag gtaagttctc attttcaatc agaggeccat catgecttga 1313 

6326 Asp Phe Lys Gin Glu 

6327 70 

6329 agagatgaaa gaaggcattg cctggattct cttctgatga aatttcatta gcaagttttc 1373 
6331 cagctaattg gcagtctaaa acttgetcat aaataaaaca tgtatttact aaatatcaga 1433 
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6333 


aatactaggt 


ttccteggat aacctaaaag ccatggtatg 


tactgtgaat 


gcaaagattc 


1493 


6335 


tgaaactaaa 


taaaaagaaa gatagtaaaa gactaatgtg 


ctataaaggc 


taagggaaaa 


1553 


6337 


taaaaaccca 


tatattaatt ttcccggcca tcttaatttt 


cag 


acc 


ctt 


cca 


agt 


1608 


6338 


























Thr 


Leu 


Pro 


Ser 




6339 






























75 






6341 


aag 


tec 


aac 


gaa 


age 


cat 


gac 


cac 


atg 


gat 


gat 


atg 


gat 


gat 


gaa 


gat 


1656 


6342 


Lys 


Ser 


Asn 


Glu 


Ser 


His 


Asp 


His 


Met 


Asp 


Asp 


Met 


Asp 


Asp 


Glu 


Asp 




6343 








80 










85 










90 








6345 


gat 


gat 


gac 


cat 


gtg 


gac 


age 


cag 


gac 


tec 


att 


gac 


teg 


aac 


gac 


tct 


1704 


6346 


Asp 


Asp 


Asp 


His 


Val 


Asp 


Ser 


Gin 


A sp 


Ser 


He 


Asp 


Ser 


Asn 


Asp 


Ser 




6347 






95 










100 










105 










6349 


gat 


gat 


gta 


gat 


gac 


act 


gat 


gat 


tct 


cac 


cag 


tct 


gat 


gag 


tct 


cac 


1752 


6350 


Asp 


Asp 


Val 


Asp 


Asp 


Thr 


Asp 


Asp 


Ser 


His 


Gin 


Ser 


Asp 


Glu 


Ser 


His 




6351 




110 










115 










120 












6353 


cat 


tct 


gat 


gaa 


tct 


gat 


gaa 


ctg 


gtc 


act 


gat 


ttt 


ccc 


acg 


gac 


ctg 


1800 


6354 


His 


Ser 


Asp 


Glu 


Ser 


Asp 


Glu 


Leu 


Val 


Thr 


Asp 


Phe 


Pro 


Thr 


Asp 


Leu 




6355 


125 










130 










135 










140 




6357 


cca 


gca 


acc 


gaa 


gtt 


ttc 


act 


cca 


gtt 


gtc 


ccc 


aca 


gta 


gac 


aca 


tat 


1848 


6358 


Pro 


Ala 


Thr 


Glu 


Val 


Phe 


Thr 


Pro 


Val 


Val 


Pro 


Thr 


Val 


Asp 


Thr 


Tyr 




6359 










145 










150 










155 






6361 


gat 


ggc 


cga 


ggt 


gat 


agt 


gtg 


gtt 


tat 


gga 


ctg 


agg 


tea 


aaa 


tct 


aag 


1896 


6362 


Asp 


Gly 


Arg 


Gly 


Asp 


Ser 


Val 


Val 


Tyr 


Gly 


Leu 


Arg 


Ser 


Lys 


Ser 


Lys 




6363 








160 










165 










170 








6365 


aag 


ttt 


cgc 


aga 


cct 


gac 


ate 


cag 


gtaaatcctt taacagacac acctgatggt 


1950 


6366 


Lys 


Phe 


Arg 


Arg 


Pro 


Asp 


He 


Gin 




















6367 






175 










180 




















6369 


tctgactagc < 


gctcaagtct aggaaaccac agtttgcata 


ttcattcatt * 


cattcatcca 


2010 


6371 


ttcattcatc < 


cattcagcaa gaattcattc atattctact 


ttatgaccat 


tgaatacaaa 


2070 


6373 


tctttttctg i 


cttggcggtt tttgtaagtc tacataattt 


ctctctagat 


ttgattctca 


2130 


6375 


aacacaattc 


tactttttga aatcctggat caaagtaaca 


tgctagtatt ; 


atttcageca 


2190 


6377 


gatttagaca « 


atttttagta taagatgacc taaaagctag 


agagtggaaa « 


aggattacca 


2250 


6379 


tattcccatc < 


cctagccgtt catataatta 


l ttcttcattt 


gtgccgtgat 


tcag 


tac 


2307 


6380 
































Tyr 




6383 


cct 


gat 


get 


aca 


gac 


gag 


gac 


ate 


acc 


tea 


cac 


atg 


gaa 


age 


gag 


gag 


2355 


6384 


Pro 


Asp 


Ala 


Thr 


Asp 


Glu 


Asp 


He 


Thr 


Ser 


His 


Met 


Glu 


Ser 


Glu 


Glu 




6385 








185 










190 










195 








6387 


ttg 


aat 


ggt 


gca 


tac 


aag 


gee 


ate 


ccc 


gtt 


gee 


cag 


gac 


ctg 


aac 


gcg 


2403 


6388 


Leu 


Asn 


Gly 


Ala 


Tyr 


Lys 


Ala 


He 


Pro 


Val 


Ala 


Gin 


Asp 


Leu 


Asn 


Ala 




6389 






200 










205 










210 










6391 


cct 


tct 


gat 


tgg 


gac 


age 


cgt 


ggg 


aag 


gac 


agt 


tat 


gaa 


acg 


agt 


cag 


2451 


6392 


Pro 


Ser 


Asp 


Trp 


Asp 


Ser 


Arg 


Gly 


Lys 


Asp 


Ser 


Tyr 


Glu 


Thr 


Ser 


Gin 




6393 




215 










220 










225 












6395 


ctg 


gat 


gac 


cag 


agt 


get 


gaa 


acc 


cac 


age 


cac 


aag 


cag 


tec 


aga 


tta 


2499 


6396 


Leu 


Asp 


Asp 


Gin 


Ser 


Ala 


Glu 


Thr 


His 


Ser 


His 


Lys 


Gin 


Ser 


Arg 


Leu 




6397 


230 










235 










240 










245 




6399 


tat 


aag 


egg 


aaa 


gee 


aat 


gat 


gag 


age 


aat 


gag 


cat 


tec 


gat 


gtg 


att 


2547 


6400 


Tyr 


Lys 


Arg 


Lys 


Ala 


Asn 


Asp 


Glu 


Ser 


Asn 


Glu 


His 


Ser 


Asp 


Val 


He 




6401 










250 










255 










260 






6403 


gat 


agt 


cag 


gaa 


ctt 


tec 


aaa 


gtc 


age 


cgt 


gaa 


ttc 


cac 


age 


cat 


gaa 


2595 
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RAW SEQUENCE LISTING DATE: 11/02/2001 
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64 04 Asp Ser Gin Glu Leu Ser Lys Val Ser Arg Glu Phe His Ser His Glu 
6405 265 270 275 

6407 ttt cac age cat gaa gat atg ctg gtt gta gac ccc aaa agt aag gaa 2643 

6408 Phe His Ser His Glu Asp Met Leu Val Val Asp Pro Lys Ser Lys Glu 

6409 280 285 290 

6411 gaa gat aaa cac ctg aaa ttt cgt att tct cat gaa tta gat agt gca 2691 

6412 Glu Asp Lys His Leu Lys Phe Arg lie Ser His Glu Leu Asp Ser Ala 

6413 295 300 305 

6415 tct tct gag gtc aat taa aaggagaaaa aatacaattt ctcactttgc 2739 

6416 Ser Ser Glu Val Asn 

6417 310 

6419 atttagtcaa aagaaaaaat gctttatagc aaaatgaaag agaacatgaa atgcttcttt 2799 
6421 ctcagtttat tggttgaatg tgtatctatt tgagtctgga aataactaat gtgtttgata 2859 
64 2 3 attagtttag tttgtggctt catggaaact ccctgtaaac aaaagcttca gggttatgtc ' 2919 
6425 tatgttcatt ctatagaaga aatgeaaact atcactgtat tttaatattt gttattctct 2979 
6427 catgaataga aatttatgta gaagcaaaca aaatactttt acccacttaa aaagagaata 3039 
6429 taacatttta tgtcactata atcttttgtt ttttaagtta gtgtatattt tgttgtgatt 3099 
6431 atcttttgtg gtgtgaataa atcttttatc ttgaatgtaa taag 3143 

6569 <210> SEQ ID NO: 25 

6570 <211> LENGTH: 259 

6571 <212> TYPE: PRT 

6572 <213> ORGANISM: Homo sapiens 

6574 <220> FEATURE: 

6575 <221> NAME/KEY: Gene 

6576 <222> LOCATION: (1)..(259) 

6577 <223> OTHER INFORMATION: Carbonic Anhydirase domain of human carbonic anhydrase III 



65S2 


Ala 


Lys 


Glu 


Trp 


(ily Tyr Ala 


Ser 


His 


Asn 


Gly 


Pro 


Asp 


His 


Trp 


His 


6583 


1 








5 










10 










15 




6586 


Glu 


Leu 


Phe 


Pro 


Asn 


Ala 


Lys 


Gly 


Glu 


Asn 


Gin 


Ser 


Pro 


He 


Glu 


Leu 


6587 








20 










25 










30 






6590 


His 


Thr 


Lys 


Asp 


He 


Arg 


His 


Asp 


Pro 


Ser 


Leu 


Gin 


Pro 


Trp 


Ser 


Val 


6591 






35 










40 










45. 








6594 


Ser 


Tyr 


Asp 


Gly 


Gly 


Ser 


Ala 


Lys 


Thr 


He 


Leu 


Asn 


Asn 


Gly 


Lys 


Thr 


6595 




50 










55 










60 










6598 


Cys 


Arg 


Val 


Val 


Phe 


Asp 


Asp 


Thr 


Tyr 


Asp 


Arg 


Ser 


Met 


Leu 


Arg 


Gly 


6599 


65 










70 










75 










80 


6602 


Gly 


Pro 


Leu 


Pro 


Gly 


Pro 


Tyr Arg 


Leu 


Arg 


Gin 


Phe 


His 


Leu 


His 


Trp 


6603 










85 










90 










95 




6606 


Gly 


Ser 


Ser 


Asp 


Asp 


His 


Gly 


Ser 


Glu 


His 


Thr 


Val 


Asp 


Gly 


Val 


Lys 


6607 








100 










105 










110 






6610 


Tyr 


Ala 


Ala 


Glu 


Leu 


His 


Leu 


Val 


His 


Trp 


Asn 


Pro 


Lys 


Tyr 


Asn 


Thr 


6611 






115 










120 










125 








6614 


Phe 


Lys 


Glu 


Ala 


Leu 


Lys 


Gin 


Arg 


Asp 


Gly 


He 


Ala 


Val 


He 


Gly 


He 


6615 




130 










135 










140 










6618 


Phe 


Leu 


Lys 


He 


Gly 


His 


Glu 


Asn 


Gly 


Glu 


Phe 


Gin 


He 


Phe 


Leu 


Asp 


6619 


145 










150 










155 










160 


6622 


Ala 


Leu 


Asp 


Lys 


He 


Lys 


Thr 


Lys 


Gly 


Lys 


Glu 


Ala 


Pro 


Phe 


Thr 


Lys 


6623 










165 










170 










175 
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6626 


Phe 


Asp 


Pro 


Ser 


Cys 


Leu 


Phe 


Pro 


Ala 


Cys 


Arg 


Asp 


Tyr 


Trp 


Thr 


Tyr 


6627 








180 










185 










190 






6630 


Gin 


Gly 


Ser 


Phe 


Thr 


Thr 


Pro 


Pro 


Cys 


Glu 


Glu 


Cys 


He 


Val 


Trp 


Leu 


6631 






195 










200 










205 








6634 


Leu 


Leu 


Lys 


Glu 


Pro 


Met 


Thr 


Val 


Ser 


Ser Asp 


Gin 


Met 


Ala 


Lys 


Leu 


6635 




210 










215 










220 










6638 


Arg 


Ser 


Leu 


Leu 


Ser 


Ser 


Ala 


Glu 


Asn 


Glu 


Pro 


Pro 


Val 


Pro 


Leu 


Val 


6639 


225 










230 










235 










240 


6642 


Ser 


Asn 


Trp 


Arg 


Pro 


Pro 


Gin 


Pro 


He 


Asn 


Asn 


Arg 


Val 


Val 


Arg 


Ala 


6643 










245 










250 










255 




6646 


Ser 


Phe 


Lys 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/983,000 



DATE: 11/02/2001 
TIME: 15:30:48 



Input Set : A:\es.txt. 

Output Set: N:\CRF3\11022001\I983000.raw 

L:13 M:270 C: Current Application Number differs, Replaced Current Application No 

L:13 M:271 C: Current Filing Date differs, Replaced Current Filing Date 

L:1900 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID# : 6 

L:1915 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID# : 6 

L:2762 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID# : 7 

L:2768 M:259 W: Allowed number of lines exceeded, <223> Other Information: 

L:3408 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:11 

L:4386 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:13 

L:4392 M:259 W: Allowed number of lines exceeded, <223> Other Information: 

L:4786 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID# : 15 

L:5392 M:257 W: Feature value mis-spelled or invalid, <22l> Name/Key for SEQ ID#:17 

L:5644 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:19 

L:5650 M:259 W: Allowed number of lines exceeded, <223> Other Information: 

L:6030 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:21 

L:6036 M:259 W: Allowed number of lines exceeded, <223> Other Information: 

L:6216 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:23 

L:6266 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQUENCE ID NOS:23 differs: 33 

L:6580 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQUENCE ID NOS:25 differs: 41 
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<110> Smith. John; Smithgcnc Inc. 

<120> Example of a Sequence Listing 

<130> 01-00001 

<1<0> PCT/EP98/00001 

<\< 1> 1998-1 2-31 v/ 

<150> US 08/999.999 

<1S1> 1997-10-1S 

<1G0> < ' 

<\10> Patcntln version 2.0 

<210> 1 

< 2 1 1 ^ 389 

< 2 1 2 > UNA 

<213> Paramecium sp. 

<220> 

< 2 2 1 > CDS - • 

< 2 22 > ( 279) ... ( 389) 

<300> 

< 3 0 1 > Doe. Richard 

<)02> Insolation and Character i za l l on of a Ccnc Encoding a 

Protease from Paramecium cp 

<303> Journal of Genes 

<304> 1 

<305> 4 

<30G> 1-7 

<307> 1988-06-31 

<308> 1234SC 

<309> 1900-06-31 

<400> 1 

agctgtagtc attcctgtgr cctctuctct ctgggcttct caccctgcta atcagatctc 
agggagagtg tcttgaccct cctctgcctt tgeagcttea caggcaggoa ggcaggcagc 
tgatgtggca attgetggea gtgecacagg cttttcagcc aggcttaggg tgggttccgc 



cgcggcgcgg cggcccctct cgcgctcctc tcgcgcctct ctctcgctct cctctcgctc 



PCI PI/EX 1X722/6 
Annex VI], page 30 



Appendix 3, page 2 

goacctoatt aggtgagcag gaggaggggg cagtcagc atg gtt tea atg etc age 296 

Met Val Scr Met Phc Scr 

1 _ S 

ctg tct ttc aaa tgg cct gga ttt tgt ttg ttt gtt tgt ttg ttc caa 

Leu Scr Phc Lys Trp Pro Cly Phc Cys Leu Phc Val Cys Leu Phc Cln 

10 15 20 

tgt ccc aaa gtc etc ccc tgt cac tea tea ctg cag ccg aat ctt ^ 399 

Cys Pro Lys Val Leu Pro Cys Mis Scr Scr -Leu Cln Pro Asn- Leu - 
25 30 \' ; 35 



<210> 2 ' 

<211> 3 7 

<212> PRT > ^ 

< 2 1 3 > Paramecium sp. 

<<00> 2 ' 

Met Vol Scr Met Phc Scr Leu Scr Phc Lys Trp Pro Cly Phc, Cys Leu 
1 S 10 IS 

■ - J 

Phc Val Cys Leu Phc Cln Cys Pro Lys Val Leu Pro Cys His Scr Scr | fl 
20 25 30 CO 

Leu Cln Pro Asn Lo-u ^> 



<210> 3 

< 2 1 1 > 11 

< 2 1 2 > PRT Q3 

< 2 1 3 > Artificial Sequence P" - * 

m 

<220> Q 
<223> Designed peptide based on siie and polarity to act as a 

linker between the alpha and beta chains of Protein XYZ. 

ID 

<400> 3 ^ 

Met Val Asn Leu Clu Pro Met His Thr Clu lie 
I 5 10 

<210> 4 
<400> 4 
000 



(Annex VIII follows| 
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.ucnLUieu cmu u.ex: cucuiupdiiy xny xi.ioliiu.cxui. a* snowi, xt. Li.e LOxxowxny 
tabic. The numeric identifier shall be used only in the "Sequence 
Listing. " The order and presentation of the items of information in the 
"Sequence Listing" shall conform to the arrangement given below. Each 
item of information shall begin on a new line and shall begin with the 
numeric identifier enclosed in angle brackets as shown. The submission 
of those items of information designated with an "M" is mandatory. The 
submission of those items of information designated with an "0" is 
optional,. Numeric identifiers <110> through <170> shall only., be set 
forth at the beginning of the "Sequence Listing." Tpe following table 
illustrates the numeric identifiers. 



Numeric 
Identi f icr 

<110> 



Definition 



Applicant 



Comments and 
Format 

Preferably max. 
of 10 names; 
one name per line; 
preferable format: 
Surname, Other/* 
Names and/or 
Initials 



Mandatory (M) or _ 
Optional ,{0) 

M v 



<120> 



<130> 



<1',0> 



<\Al> 



<1S0> 



Title of 
Invention 

File Reference 



Current Appl ica - 
tj^pn Number 



Current Filing 
Date 



Personal file 
rc f crcncc 



Specify as: 

US 07/999, 999 or 

PCT/US96/99999 



M 



M. when filed prior 
to assignment of 
appl . number 

M, if available 



Specify as: yyyy-mm-dd M, if available 



Prior Application Specify as: 
Number US 07/999,999 or 

PCT/US96/99999 



M, if applicable 
include priority 
documents under 
35 USC 119 and 
120 



03 

m 



CD 

m 

o 
O 
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<1S1> 



Prior Application Specify as: yyyy-mm-dd M, if applicable 
Filing Date 



< 1 G0> 



Number of SEQ ID Count includes 
NOs total number of 

SEQ ID NOs 



<170> 



Software 



Name of software used 
to create the 
Sequence Listing 



<210> 



SEQ ID NO: H : 



Response shall be an 
integer repre- 
senting the SEQ 
ID NO shown 



<2 1 1 > 



Length 



Respond with an integer M 
expressing the number 
of bases or amino acid 
res idues 
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«2 12> 



Type 



Whether presented 
sequence mole- 
cule is DNA, 
RNA, or PRT 
(protein) .If 
a nucleotide 
sequence con- 
tains both DNA 
and KNA frag- 
ments, the 
type shall be 
"DNA . " In ad- 
dition, the 
combined DNA/ 
RNA molecule 
shall be further 
described in 
the <220> to 
<223> feature 
section. /- 



M 



<213> 



Organi sm 



<220> 



Tea Lure 



Scientific name, 
i.e. Genus/species, 
Unknown or Artifi- 
cial Sequence. In 
addition, the 
"Unknown" or 
"Artificial Se- 
quence" organisms 
shall be further 
described in the 
<220> to <223> 
feature section. 

Leave blank after 
<220>. <221-223> 
provide for a 
description of 
points of bio- 
logical signi- 
ficance in the 
sequence . 



M, under the 
following condi- 
tions : if "n, " 
"Xaa , " o r a mod- 
i f i ed or unusual 
L-amino acid or 
modified base was 
used in a se- 
quence; if ORGAN- 
ISM is "Artifi- 
cial Sequence" or 
"Unknown " ; if v 
molecule is 
combined DNA/ RNA . . 



CD 

m 

I 

w 
m 

o 
o 



<221> 



Name/ Ke y 



Provide appropriate 
iden t i f i e r for 
f ea ture, pre- 
ferably from 
WIPO Standard 
ST. 2b (1998), 
Appendix 2, 
Tables 5 and 6 



M, unde r the f ol - 
lowing conditions: 
if "n, " "Xaa , " or 
a mod i f l ed or un- 
usual L-amino 
acid or modi f ied 
base was used in 
a sequence 



<222> 



Locat ion 



Spec i f y locat ion 
within sequence; 
where appropriate 
state number of 
first and last 
bascs/amino acids 



M, under the fol- 
lowing conditions: 
if "n , " "Xaa , " or 
a mod i f i ed or un - 
usua 1 L-amino ^ 
acid or mod i f i cd 
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<223> 



Other Infor- 
mation 



I'M 



in feature 



Other relevant 

information; 

four lines maximum 



base was used in 
a sequence 

M, under the fol- 
lowing conditions: 
if "n, " "Xaa, " or 
a modified or un- 
usual L-amino acid 
or modi f icd base 
was used in a 
sequence; if 
ORGANISM 
is "Artificial 
Sequence" or . 
"Unknown"; -i-f 
mol;cculc is com- 
bined DNA/RNA. 



''I'Jircqr.l.i,,, 



<300> 



<301> 



Publ ica tion 
Inf orma Lion 

Authors 



Leave blank 
after <300> 

Preferably max 
of ten named 
authors of publi- 
cation; specify 
one name per line; 
preferable format: 
Surname, Other 
Names and/or 
Initials 



S3 

n 

3 



<302> 
<303> 
<304 > 
<30S> 
<30G> 
<307> 



< 300> 



Ti tic 

Jou rna 1 

Volume 

Issue 

Pages 

Date 



Da taba sc 
Acccs s i on 
Numbe c 



Journal date on which 
data published; 
specify as yyyy-mm- 
dd, MMM-yyyy or 
Season-yyyy 

Accession number 
assigned by data- 
base including 
database name 



0 
0 
O 
O 
O 



> 

2 

Tt 

o 
O 



<309> 



<310> 



Da taba se Ent r y 
Date 



Patent Document 
Numbe r 



Date of entry in 
database; specify 
as yyyy-m/n-dd or 
KMM - yyyy 

Document number; 
for patent-type 
citations only. 
Spec i(y as, for 
example, US 
07/999, 999 



O 
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<31 1> 



<312> 



<313> 



<400> 



Patent Filing 
Date 



Publication Date 



Document filing 0 
date, for patent- 
type citations only; 
specify as yyyy-nvn-dd 

Document publication O * 
date, for 
patent-type 
citations only; 
specify as yyyy-mm-dd« 

FROM (position) TO 0 
(position) 

SZQ ID NO should " ; :M 
follow the 
numeric identifier 
and should appear 
on the line pre- 
ceding the actual* v / '* 
sequence 

5. Section 1.024 is revised to read as follows: 

1.024 Form and format for nucleotide and/or amino acid sequence 
submissions in computer readable form. 



Relevant 
Residues 

Sequence 



(a) The computer readable form required by 
following specifications: 



1.021 (c) sha 1 1 meet the 



(1) The computer readable form shall contain a single "Sequence Listing" 
as cither a diskette, scries of diskettes, or other permissible media*, 
outlined in paragraph (c) of this section. 

(2) The "Sequence Li s t ijng '\\ i n paragraph (a) (1) of this section shall be 
submitted in American Standard Code for Information Interchange (ASCII) 
text. No other formats shall be allowed. 

(3) The computer readable form may be created by any means, such as word 
processors, nucleotide/amino acid' sequence editors or other custom 
computer programs; however, it shall conform to all specifications 
detailed in this section. 

(4) File compression is acceptable when using diskette media, so long as 
the compressed file is in a sc 1 f -c>: t ract i ng format that will decompress 
on. one of the systems described in paragraph (b) of this section. 

(5) Page numbering shall not appear within. the computer readable form 
version of the "Sequence -Listing" file. 

(6) All computer rcadab l:c' f o rms shall have a. label permanently affixed 
thereto on which has been; hand-pr i ntcd or typed: the name of the 
applicant, the title of the invention, the date on which the data were 
recorded on the computer readable form, the operating system used, a 
reference number, and an application serial number and filing date, if 
known . 
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(b) Computer readable form submissions must meet these format 
rcqui rements : 

(1) Computer: I OM PC/ XT/ AT , or compatibles, or Apple Macintosh; 

(2) Operating System: MS-OOS, Unix or Macintosh; 
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